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1 .  Introduction  and  Summary 

The  object  of  this  paper  is  to  develop  statistical  signal  detection 
techniques  for  a  wide  class  of  stochastic  processes  with  stationary 
independent  symmetric  increments.  That  is,  the  basic  premise  in  this 
situation  is  that  the  increments  from  a  regularly-sampled  continuous 
parameter  process  are  i.i.d.  with  a  continuous  distribution  satisfying: 
F(x)  +  F(-x)  =  1  for  all  x.  Taking  symmetry  about  zero  is  no  loss  of 
generality,  since  one  can  initially  take  symmetry  amount  some  constant 
c  and  then  one  can  re-define  the  process  to  be  symmetric  about  zero. 

Let  (Z(t),  t  ^  0}  be  a  stochastic  process  of  this  type,  then  the 
realized  data  from  signal  detection  viewpoint  will  be  of  the  following 
two  types : 

(a)  Historical  Data:  =  (Xj,  X2>  X^) ,  where  Xf  = 

Z(rA)  -  Z((r  -  1)A),  Z(0)  =  0  and  A  >  0. 

(b)  Two-sample  Data:  (0C,Y)  =  (X^  X2,  ...,  Xm;  Yj,  Y2>  ...,  Yn) , 
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Xj's  and  Y^'s  are  defined  similar  to  (a)  above  with  respect  to 
stochastic  processes  (Z(t),  t  >_  0  with  law  }  and  (Z'(t),  t  >  0 

Li 

with  law 

The  family  of  all  stochastic  processes  with  stationary  independent 
symmetric  increments,  in  the  sequel,  will  be  denoted  by  9.(SISI) .  Three 
types  of  detection  problems  will  be  treated.  These  are  outlined  below 
in  terms  of  pure  noise  (PN)  and  noise  plus  signal  (N  +  S) ;  see  Bell 
(1964a) . 

(i)  Goodness-of-fit  Detectors.  Here,  we  detect  the  problem  of 
PN:  o£  =  e  fi(SISI)  against  N  +  ,  where  is 

completely  specified. 

(ii)  Class-fit-Detectors .  This  involves  detecting  PN:  e  Sl(SISI) 

versus  N  +  S:  £  ft(SISI) . 

(iii)  Two-sample  Detectors .  This  problem  is  to  detect 
PN:  =*2  e  ^(SISI) ,  i  =  1,  2)  against  N  +  S:  *  *V 

where  again  ^i  and  ^2  are  *n  ^(SISI) . 

Let  a  denote  the  probability  that  the  detector  will  produce  a  false 
alarm  (PFA) ;  and  denote  by  B  the  probability  that  the  detector  will 
produce  a  false  dismissal  (PFD) .  The  detectors  proposed  in  this  in¬ 
vestigation  are  optimal  in  the  sense  that  for  a  fixed  PFA  a,  the 
procedure  has  minimum  PFD,  B-  The  organization  of  the  paper  is  as 
follows . 

In  Section  2  some  examples  of  laws  from  the  family  f2(SISI)  are 
given.  Section  3  starts  with  some  basic  terminology  used.  Also,  dis- 
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cussion  about  minimal  sufficient  statistics  (M-S-S) ,  maximal  statistical 
noise  (M-S-N)  and  a  non-parametric  property  for  the  underlying  detection 
statistics  is  given.  In  part  4  relevant  distributions,  permutations 
and  alternative  statistics  with  some  examples  are  discussed.  Section  5 
treats  goodness-of-fit  detectors.  In  the  next  section  6  class-fit 
detectors  are  developed.  The  two-sample  detectors  are  dealt  with  in 
section  7.  The  final  part  concludes  with  point  estimation  and  confidence 
bounds  for  F  in  fi(SISI)  which  are  quite  useful  in  detection  methodology. 


For  various  structural  results  and  properties  in  the  case  of 
stochastic  processes  belonging  to  the  class  ft(SISI),  the  relevant 
references  are  Doob  (1953),  Feller  (1966)  and  Basawa  and  Rao  (1980).  A 
different  but  related  concept  of  symmetry  is  investigated  by  Bell  and  Haller 
(1969),  Bell  and  Smith  (1969,  1972),  and  Ahmad  (1974).  Wiener-Levy  pro¬ 
cesses  which  are  closely  connected  with  processes  of  the  class  fl(SISI) 
are  treated  in  Bell  et  al.  (1980).  The  detection  techniques  developed 
in  this  paper  carry  over  to  discrete  time  as  well  as  continuous  time 
parameter  stochastic  processes  in  the  family  fl(SISI) . 

2.  Some  Examples  of  Laws  from  (SISI) 

Example  2.1.  (Wiener).  Let  {Z (t) :  t  >_  0}  be  a  Gaussian  process 

satisfying  (a)  Z(0)  =  0;  (b)  E (Z (t) )  =  0,  (c)  Cov  (Z(s),  Z(t))  =  0 

2 

min  (s,t).  Then  for  any  A  >  0,  Y^,  . . . ,  are  i.i.d.  N(0,  0  A), 

when  Yj  =  Z(jA)  -  Z([j  -  1]A).  That  is  the  process  has  SISI's,  or 
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Example  2.2.  (Compound  Poisson).  Let  Y^,  Y2,  . . . ,  Y  ,  ...  be  i.i.d. 

'Cr(_9>0);  (N(t) :  t  >_  0}  be  a  H-P-P  (Homogeneous  Poisson  Process) 

N  ft ) 

with  parameter  X;  and  Z(t)  =  l  Y..  Then  (Z(t) :  t  _>  0}  is  a 

1  -1 

Compound  Poisson  Process.  Further,  for  each  A  >  0,  Y^,  ...,  Y^  are 
i.i.d.  F*,  with  F*(x)  +  F*(-x)  =  1  for  all  x,  when  Y^  =  Z(iA)  - 
Z([j  -  1]A). 


Example  2.3. 

exponential , 
Then  (Z(r): 


(Random  Walk) . 

D  -  E(A) ,  i .e . , 
r  =  1,  2,  . . . } 


Let  Wj ,  W^,  .  ..,  Wn>  ...  be 
^ (x)  =  j  e"X^  ;  and  Z(r) 
has  a  law  ^  in  fi(SISI). 


i.i.d.  double 


Example  2.4.  (Symmetric  Stable  Distributions).  A  distribution  F(x)  is 

said  to  be  strongly  unimodal,  if  and  only  if,  the  convolution  of  G 

with  any  unimodal  distribution  is  unimodal  (normal  and  Wishart  distributions 

are  strongly  unimodal) .  If  F  and  G  are  symmetric  about  zero  and 

unimodal  distributions,  then  so  is  their  convolution  F*G.  Furthermore, 

all  stable  distributions  (as  defined  below)  with  characteristic  functions 

given  by  exp  ( - 1 1 1  ) ,  0  <  a  <_  2  are  unimodal.  Let  X,  X^,  ...  denote 

mutually  independent  random  variables  with  a  common  distribution  H 

and  set  X*  =  X,  +  X.  +  ...  +  X  .  The  distribution  H  is  stable  if  for 
n  1  2  n 

d 

each  n  there  exist  constants  b  >0  and  c  such  that  X*  =  b  X  +  c 

n  n  n  n  n 

and  H  is  not  concentrated  at  the  origin.  Consequently,  if  X^,  X£,  ... 
are  i.i.d.  are  stable  and  symmetric  about  zero,  then  {Z (t)  =  X*, 
r  =  1,  2,  ...}  has  a  law  <7^  in  fi(SISI). 
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Remark  2.1.  Stable  distributions  are  natural  generatizations  of  the 

1  /  cl 

normal  family.  Only  the  norming  constants  bn  =  n  are  possible; 
a  is  called  the  characteristic  exponent  of  the  distribution  H(-). 

All  stable  distributions  are  continuous.  For  many  applications  and 
other  results  for  stable  distributions  see  Feller  (1966) . 

3.  Sufficient  Statistics  and  the  Non-parametric  Property 

In  developing  the  statistics  to  be  used  it  is  convenient  to  delineate 
two  types  of  distribution-free-ness . 

Definition  3.1.  (a)  A  statistic  T(*)  is  NPDF  wrt.  a  family  51' 

of  stochastic-process  laws,  if  there  exists  a  cpf  Q(*)  such  that 

P{T$)  <  t\ <X  )  =  Q(t)  for  all  t,  and  for  all  ^  e  51'. 

% 

(b)  A  family  {T*(*  e  52'}  is  PDF  wrt  a  family  52', 

if  there  exists  a  cpf  Q*  such  that 

P  T*(^;<?f)  t|c^}  =  Q*(t)  for  all  t  and  for  all  e  52'. 

Example  3.1.  Consider  Example  2.1  with  52'  =  {WLP:  a  >  0}  and  data 

4  9-1 

Y  =  (Y  ...,  Y  ).  Let  T(Y)  =  j  [  f  y?]  [  f  y2.]  and 
Of1  y  'V;  H  j  J  5  J 

9 

T*(Y,  X)  -  °~2  [  X  y-JA*1.  Then  T(-)  is  NPDF  wrt  52',  with 

*\j  i  D 

Q  =  F(4;  5),  and  is  PDF  wrt  52'  with  Q*  =  Xg- 

The  interest  here  is  in  the  wider  family  52  (SISI).  In  order  to  con¬ 
struct  the  PDF  and  NPDF  statistics,  one  needs  the  minimal  sufficient 
statistic  (M-S-S)  and  its  complementary  statistic,  the  maximal 
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statistical  noise,  M-S-N,  to  be  defined  below. 

Notation.  Let  X  =  (X- ,  . .  . ,  X  )  be  the  vector  of  increments,  i.e.. 

J.  n 

X  =  Z(jA)  -  Z([j  -  1] A) ;  let  S  (X)  =  (|x|(l),  ...,  |x|(n)),  the  vector 

J  -L  % 

of  ordered  absolute  values;  let  N, (X)  =  (e,  R*) ,  where 

I  %  Oi  Oi 

e  =  [£(X  ),  e(X  ),  ...,  e  (X  )],  and  R*  =  [R(|X.|),  ...,  R(|xJ)]; 
f\j  x  £,  ii  ^  x  n 

and  fijfX)  =  [Sj  (X)  ,  Nx  qC)  ]  . 

Theorem  3.1.  (a)  S,  (X)  is  the  M-S-S  for  fl(SISI) ,  and  it  is  complete. 

l  rv 

(b)  S  (X) ,  e  and  R*  are  mutually  independent 

1  a.  'x,  % 

(c)  is  1-1  a.e. 

Definition  3.2.  Let  S(X)  be  a  M-S-S  for  fi' ;  S(X)  =  fS(X),  N(X)1 

%  <\j  %  f\j 

be  1-1  a.e.,  and  N(X)  be  independent  of  S(X).  Then 

'Xi  % 

(a)  6(*)  is  called  the  BDT  (basic  data  transformation)  for 

and 

(b)  N(X)  is  called  the  M-S-N  (maximal  statistical  noise)  for  ft’ . 

a, 

Theorem  3.2.  (a)  S^X)  =  [Sj  (X) ,  N^)]  is  the  BDT  for  ^(SISI); 

(b)  N. (X)  =  [e,  R*]  is  M-S-N  for  Q* . 

i  ^  O/  'V 

The  general  rule  for  employing  these  statistics  in  signal  dection 
is  as  follows. 

Rule  of  Thumb.  (A)  For  situations  involving  a  specific  law,  (e.g., 

PN:  &(  =  (?Cq)  >  employ  a  PDF  statistic  based  on  the  M-S-S, 
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(B)  For  situations  involving  the  underlying  structure  of  the  family 
e.g.,  PN:  X  e  or  PN:  X ^  =  X ^ ,  employ  an  NPDF  statistic  based 
on  the  M-S-N. 

This  overall  principle  will  be  followed  in  the  sequel.  However,  in 
order  to  make  efficient  use  of  this  principle,  one  should  consider  several 
alternate  versions  of  the  M-S-S  and  the  M-S-N. 

4.  Distributions,  Permutations  and  Alternate  Statistics. 

The  only  versions  of  M-S-S  and  M-S-N,  known  to  the  authors,  are 

those  related  to  maximal  invariants  or  oermutation  statistics.  S..  (X) , 

i  % 

given  above,  is  an  appropriate  maximal  invariant,  as  is  N,  (X) . 

1 

The  set  of  permutations  of  interest  here  is  the  Sign-Time  grout), 

S*. 

n 

Definition  4.1.  (a)  S*  =  {all  permutations  of  coordinates  of  X  and 

-  n  r\j 

changes  of  signs  of  coordinates.} 

(b)  S*(X)  =  {y(X) :  y  e  S*  }  is  the  S*  orbit  of  X. 

n  *\,  Oj  n  n 

Example  4.1.  Let  n  =  2,  and  x  =  {-5.6,  0.9)  then,  its  orbit  is 
S2($  =  {y(£>:  Y  £  S2} 

=  {(-5.6,  0.9),  (-5.6,  -0.9),  (5.6,  0.9),  (5.6,  -0.9), 

(0.9,  -5.6),  (-0.9,  -5.6),  (0.9,  5.6),  (-0.9,  5.6)}. 

Theorem  4.1.  (a)  S*  is  a  group  of  order  (n!)(2n) 

(b)  S*  is  a  wreath  product  group. 

(c)  The  orbit,  S*(X)  contains  (n!)(2n)  points  for  almost 

n 
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every  X  • 

a, 

Based  on  these  permutations,  one  can  now  give  useful  additional 
versions  of  the  M-S-S  and  the  M-S-N. 

Theorem  4.2.  S>_  (X) ,  the  S*  orbit  of  Definition  4.1,  is  a  M-S-S. 

—  XI  n 

In  order  to  construct  the  M-S-N  based  on  S*,  one  needs  the  following 
definitions 

Definition  4.2.  (a)  A  (measurable,  real-valued)  function  h  (•)  is 

called  a  B-Pitman  function  wrt  a  set.  S',  of  permutations,  and  a 
family  of  stochas ti c-process  laws  if 

P(h (X)  =  h(y(X)) |  }  =  0,  unless  X  =  y(X), 

%  %  %  0/ 

for  all  permutations  y  in  S'  and  all  oC  in  O' . 

(ii)  Let  R(h(X))  =  T  e  (h(X)  -  h(y(X))  ,  where 
-A  yeS’  ^  , 

e(u)  =  1  if  u  >1  0;  =0,  if  u  <  0.  Then,  if  h(-) 

is  a  B-Pitman  function  wrt  S'  and  0' ,  R(h(-))  is  called  a  B-Pitman 

statistic  wrt  S'  and  O'. 

Example  4.2.  Let  n  =  3,  and  h(x^,  x^,  x^)  =  x^  +  2x2  +  3x^.  Then, 
h(-)  is  a  B-Pitman  function  wrt  S*  and  0(SISI).  Further,  R(h(-)) 
is  a  B-Pitman  function  wrt  S*  and  Q(SISI). 

One  can  prove  immediately 

Theorem  4.3.  (Maximal  Statistical  Noise  Theorem)  (a)  If  h(-)  is 
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a  B-Pitman  function  wrt  S*  and  f2(SISI),  then  R(h(*))  is  (a  version 
of)  M-S-N. 

(b)  T*  is  NPDF  wrt  £2(SISI)  iff  there  exist  a  (measurable) 
function  W(*)  and  B-Pitman  function  h*  such  that  T*(X)  =  R(h*(X)). 

Op  *\j 

Corollary  4.1,  (a)  There  are  an  infinite  number  of  (mutually  equivalent) 

versions  of  M-S-N. 

(b)  N, (X) ,  of  Section  3,  can  be  epxressed  in  terms  of  at  least 

1  v, 

one  B-Pitman  statistic;  and 

(c)  If  JC  £  ft(SISI),  P{R(h(X))  =  k}  =  f(n!)(2n)]"1  for 
1  £  k  £  (n!) (2n) . 

At  this  point  one  has  the  following  formulations  of  basic  statistical 
structure 


M-S-S 

(i)  S  (X)  =  [  |  X  |  (1) ,  ....  |  X |  (n)  ] 

t  n/ 

(ii)  S*(X)  =  (y(X) :  YES*} 

Z  r\j  %  n 

(i ii)  S_(X)  =  G*. ( • )  (of  Def.  4.3  below) 

3  n 

M^S-N 

(i)  Nj(X)  .=  [eCXp,  e(Xn);  RCjXjl),  R(|Xn|)] 

(ii)  N,  (X)  =  R(h(X)),  where  h(*)  is  a  B-Pitman  function  wrt  S* 
h  'x,  'v  n 

and  fl(SISI). 
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In  order  to  make  use  of  some  standard  nonparametric  procedures,  one 
introduces  the  entities  below. 

Definition  4.3.  Let  F(*)  satisfy,  F(x)  +  F(-x)  =  1  for  all  x. 

(a)  G  (•)  is  defined  by  G  (z)  =  0,  for  z  <  0,  and  =2F(z)  -  1 

r  r 

for  z  >  0. 

-l  n 

(b)  G*(*)  is  defined  by  G*(z)  =  n  J  (z  -  |xj(i)). 

One  proves  easily 

Theorem  4.4.  ST(X)  =  G*(0  is  a  M-S-S. 

- - —  3  %  n 

Further,  it  follows  from  Bi rnbaum  and  Rubin  (1954),  and  Bell  (1964a,  b) , 

that 

Theorem  4.5  .  (PDF  Theorem)  bach  statistic  of  the  form 
<J>[Gp(|x|  (1)),  ...,  Gp ( | X |  (n)) ]  or  r(^*(0) 
is  PDF  wrt  n(SISI). 

This  theorem  will  be  used  in  constructing  all  procedures  for  one  of 
the  signal  detection  models  of  the  sequel. 

One  final  statistical  tool  will  be  introduced  in  this  section.  It  is 
a  slight  generalization  of  the  method  of  Durbin  (1961)  and  Bell  and 
Doksum  (1965),  and  should  be  employed  primarily  to  avoid  certain  distri¬ 
bution  problems.’ 

Let  6 (X)  =  [S (X) ,  N (X)  ]  ,  where  6(*)»  S(X),  and  N(X)  are  res- 

f\j  f\j  'Xj  % 

pectively,  the  BDT,  M-S-S  and  M-S-N  of  a  family  .Q1  of  stochastic 
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process  laws.  Let  X  be  data  governed  by  a  law  (usually  unknown) 


m 


fi',  and  let  Y  be  generated  by  a  law  (known)  in  9'  and 


% 


be  independent  of  X. 

% 


Definition  4.4.  (a)  Y  is  called  E-S-N  (Extraneous  statistical  noise) 

- - — -  % 

(b)  Y 1  =  6~*[S(x),  N (X) ]  is  called  R-S-N  (randomized  statistical 

'Xj  Oj  r\j 

noise) . 

Example  4.3.  Let  X  =  (X^ ,  . . . ,  X^)  be  the  increments  of  data  governed 
by  a  law  in  Sl(SISI).  Let  Y  =  (Y.  ,  Y_c)  be  the  increments 

r  f\,  1  zr> 


of  a  Wiener  process  (See  Section 

2)  with 

a  =  1/4. 

Then  X^ , 

.,  X 

are  i.i.d.  F  ( • ) ,  unknown ;  and 

V 

,  Y25  are 

i.i.d.  $ . 

One 

simple  version  of  the  M-S-S  is 

S.(X)  = 

1 

(|x|(l),  . 

..,  I X | (25) , 

in 

which  case  one  chooses  N  (X)  = 

1  % 

[etxp,  • 

...  e(X25); 

R( 1 Xj | ) ,  .. 

*  J 

R(|X  I)],  and,  then  6(X)  =  [S  (X) ,  N  (X)].  One  then,  forms 

ZD  f\j  L  *\j  ±  *\j 


.-1 


( |  Y  |  Cl)  ,  •••,  |  Y  |  (25) ;  eCXp,  ...,  e(X25);  RCjx^),  ...,  R(|X25|))  = 


(Y*  ...,  Y1  ),  where  Y!  =  [2  e  (X.)  -  1]  |  Y.|  (R(  |X .  | ) ) .  This  means  each 

1  J  3  3  1 

X  is  replaced  by  that  Y,  which  has  the  same  absolute-value  rank  and 
same  sign.  Then,  it  can  be  proved  that  Yj ,  ...,  Y),,.  are  i.i.d. 

N(0,1). 

These  ideas  are  formalized  in  the  theorem  below. 


Theorem  4.6*  (Randomized  Noise  Theorem).  Let  X  =  (X1 ,  ...,  X^)  and 

.  ....  .  i  —  ■  ■  X  II 


Y  =  (Y  ,  . .  .  ,  Y  )  be  independent,  and  be  generated  by  laws  *  and 
a,  1  n 


and  * 

respectively  in  fl'  .  Let  6(-)>  S(X)  and  N(X),  be  respectively,  the 

BDT,  M-S-S  and  M-S-N  for  9' .  If  Y'  =  (Y*  Y')  = 

^  1  n 
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S"1 [S (Y) ,  N (X) ] ,  then  Y'  =  Y  ^  <X  *  e  ft' . 

Thus,  the  distribution  of  the  H-S-N,  Y,  has  been  imposed  on  the 

% 

data  while  the  M-S-N  of  the  data  is  preserved. 

Now,  one  is  in  a  position  to  treat  the  pertinent  inference  problems. 

5 .  Goodness -of- fi t  Detectors 

The  situation  to  be  treated  here  is 

PN:  oT  =  e  HCSISI )  vs  N  +  S  *0- 

One  notes  that  the  law  and  the  sampling  spacing  A,  uniquely 

determine  the  common  distribution  F^,  of  the  independent  symmetric 
increments  =  Z(jA)  -  Z([j-1]A),  for  j  =  1,  2,  ....  n,  where 
Z(0)  =  0. 

Example  5.1.  (a)  In  Example  2.1,  ^  is  such  that  =  N(0,  a  A). 

(b)  In  Example  2.3,  if  A  =  3,  then  F~  is  that  distribution  function 

2-3 

with  characteristic  function  <f>(u)  =  (1  +  t  /X)  .  In  Fxamnle  2.2.,  F^ 

is  the  distribution  function  with  characteristic  function 
<t>(u)  =  exp  { - AA [  1  -  <|>*(u)]},  with  $*(u)  =  Cu6)_1  sin  u0. 

For  a  given  constant  sampling  spacing  A,  one  can  rephrase  the 
detection  problem  as 

PN*:  F  =  Fq  e  ft*  vs  N  +  S:  F  /  F0 

where  FQ(x)  +  FQ(-x)  =  1  for  all  x,  i.e.,  FQ  exhibits  symmetry  wrt  0. 
This  being  the  case,  one  should  base  the  decision  rule  on  the  M-S-S: 
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s1(jy  =  [|x|(D,  . 

G  ,  where  G_(x) 
0 

Hence  (by  Bell 

consider  decisions 

G„  ( | X | (n) )  ]  . 

F0 


.  ,  |x|(n)],  and  a  PDF  statistic  using  the  cpf 
=  2F(x)  -  1,  x  >  0. 

(1964b)  and  Birnbaum  and  Rubin  (1954)), one  should 
based  on  statistics  of  the  form  ij;[Gp  (|x|(l)),  . 


Decision  Rule  5.1. 

where  G*(z)  =  n  * 
n  ' 

to  achieve  FAR, a. 


Decide 

n 


l  e  (z 


j=l 


N  +  S  iff 

-  I  X 1  (j  )  )  , 


sun  G*(z)  -  G  (z)  >  d', 
z  1 0 

and  d'  =  d'(a,n)  chosen  so  as 


Decision  Rule  5.2.  Decide  N  +  S  iff 


[G*(z) 


Gp  (z)]2dGp  (z)  >  w* 
F0  F0 


w*(a,n) . 

Of  course,  several  other  goodness -of-fit  statistics  could  have  been 
used.  [See,  e.g.  Bell  (1964b),  Model  I  J. 

If  one  knows  that  the  signal  has  the  effect  of  yielding  a  specified 
distribution,  H  for  X*  =  (X*  ...,  X*),  where  X*  =  |x|(j),  then 

"w  1  K  J 

one  would  prefer  a  decision  rule  of  the  form  below.  [The  problem  here 
would  be  PN: 


* 


vs  N  +  S : 


k 

Decision  Rule  5.3.  Decide  N  +  S  iff  h(x|,  ...,  x£)  >  C*  it  gp  (xt) , 
where  C*  =  C(a,k)  and  g(-)  and  h(-)  are  the  appropriate  densities. 

F0 

These  three  decision  rules  will  be  illustrated  with  data  in  the 


appendix. 
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6 .  Class -Fit  Detectors . 

Here  one  has  the  detection  problem  PN:  e  P(SISI)  vs  N  +  S: 

c %  £  fi(SISI).  Since  this  problem  does  not  concern  a  specific  value  of 
the  parameter  (or  law),  one  is  led  to  NPDF  statistics  and  M-S-N.  One 
useful  version  is  N  (X)  =  fe(X  ),  ...,e(X  );  R(  |  X  | ) ,  ....  R(|x  |)]. 

i.  1 1  ,  X  11  J.  II 

n 

Decision  Rule  6.1.  Decide  N  +  S  iff  T*(X)  =  T  c  (X.)  >  c  or  <  c  , 

where  c .  =  c . fa, n) . 

.1  .1 

This,  of  course,  is  a  Sign  Detector  or  a  Throshhold  Detector.  It 

makes  no  use  of  the  rank  vector. 

In  order  to  make  use  of  all  of  the  components  o4:  S.(X),  one 

1  % 

introduces  the  Wilcoxon  1-sample  statistic. 


Decision  Rule  6.2.  Decide  N  +  S  iff  W*(X)  =  J  e  (X.)R(|x.l)  <  k, 

or  >  k^,  where  k.  =  k.fa.n). 

-  2  j  3 

The  statistics*  employed  in  the  two  decision  rules  above  involve 

statistics  which  are  well-tabulated.  However,  these  statistics  both 

have  discrete  distributions,  and  it  is  necessary  to  turn  to  randomized 

procedures  if  one  wishes  to  achieve  certain  particular  PFA's, 

One  such  procedure  is  based  on  E-S-N  (extraneous  statistical  noise-)  , 

i.e.,  Y  =  (Y  ,  Y  )  independent  of  the  data  and  generated  by  a 

'h  1  n 

known  law  %  in  ft(SISI)  . 

Let  Yj,  be  i.i.d.  N(0,1).  'IThis  corresponds  to  data 

Z  =  [Z(A),  Z(2A),  Z(nA)]  from  a  WLP  satisfying  y(t)  =  0  and 

2 

o  A  =  1.  When  the  process  { Z (t ) }  is  independent  of  the  data  so  are 
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the  Y!s. 

From  Section  4,  one  knows  that  the  BDT,  6(*),  is  such  that 
6fX)  =  [S  (X) ,  N  (X) ] ,  where  S.  (X)  =  [|x|(l),  ...,  |x|(n)]  and 

i  1  *\j  i  % 

Nj(X)  =  [e(Xp,  ...,  e(Xn);  R(|Xj|)  ,  . . .  ,  R(|xJ)3 .  In  order  to  apply 

the  Randomized  Noise  Theorem  (Theorem  4.6),  one  forms  Y'  =  (Y',  Y')  = 

1  n 

6_1[Si(Y),  NjfX)].  In  this  case  Y!  =  [2  e  (X . ) - 1]|  Y  | (R( | X . I ) ) .  This 

means  N, (X)  =  N,(Y')-  By  Theorem  4.6,  Y! ,  ...,  Y'  are  i.i.d.  N(0,1) 

I  %  I  <v,  A  n 

under  PN:  e  fi(SISI) .  One  useful  decision  rule  based  on  the  Kolmogorov- 

Smirnov  statistic  is  then. 

1  n 

Decision  Rule  6.3.  Decide  N  +  S  iff  sup  |—  /  (z  -  Y!)  -  4>(z) |  >  d', 

z  n  1  J 

where  d'  =  d(a,n). 

Again  one  notes  that  Decision  Pule  6.3.  could  have  employed  any 

goodness -of-fit  statistic.  (See,  e.g..  Bell  (1964a);  Model  I  Detectors)). 

A  different  type  of  detection  procedure  is  developed  if  one  employs 

a  different  version  the  M-S-N. 

Let  S*  (See  Definition  4.1)  be  the  set  of  sign-time  permutations 
n 

of  coordinates  of  X  =  (X  ,  ...,  X  ),  and  let  h(X)  =  7  j  X..  Then  one 

%  1  n  W  i  J 

has  the  following  result. 

Theorem  6.1.  (a)  h(-),  above,  is  a  B-Pitman  function  wrt  H(SISI) 

and  S* .  si 

n  U 

(b)  R(h(X)),  as  in  Definition  4.1,  is  M-S-N.  P 

n, 

(c)  Under  PN,  R(h(X))  'v  D  -  U{l,  2,  ...,  k*},  i.e.,  has  a  discrete 

a. 

uniform  distribution  over  the  integers  { 1 ,  2,  ...,  k*},  where 
k*  =  (n!)(2n). 
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Decision  Rule  6.4.  Decide  N  +  S  iff  R(h(X))  <  b,  or  >  b  . 

- _ -  —  —  l 

where  bj  =  (n!)(2n)(|)  and  b0  =  1  +  (1  -  |)(n!)(2n). 

Of  course,  one  could  construct  an  analagous  decision  rule  for 
each  B~ Pitman  function. 

These  decision  procedures  are  illustrated  in  the  appendix. 

The  final  detection  problem  of  this  paper  concerns  the  equality  of 
two  stochastic  process  laws. 


7.  Two- Sample  Detectors. 

The  detection  problem  considered  here  is 


PN 


:  ^ \  =  ^2  vs  N  +  S:  ^  ^2 


where  e  fi(SISI). 

Consider  the  situation  where  X  =  (X. ,  . . . ,  X  )  is  generated  by 

%  i  m  i 


and  Y  *  (Y  ,  . . . ,  Ym)  is  generated  by 


and  is  independent  of  X. 


n 


Under  PN,  Z  =  (X,Y)  =  (X  ,  X  ;  Y  ,  ...,  Y  )  =  (Z  ,  ...,  Z  ) 

%  0,  1  mi  n  I  IN 

is  distributed  as  a  random  sample  from  an  (unknown)  F(0*  symmetric  wrt  0. 

Since  no  specific  value  of  a  parameter  (or  law)  is  involved,  one  needs 
an  NPDF  statistic.  Such  a  statistic  must  be  based  on  some  version  of 
the  M-S-N,  e.g.,  S^Z)  =  [e(Zj)  ,  .  ..,  (ZN) ;  R(  Zj  ),  R(  ZN  )]. 

n 

Decision  Rule  7.1.  Decide  N  +  S  iff  W  =  £  R(|Z_.|)  ^  b,  or  <_  b0, 

!  i 

where  b.  =  b.(a,m,n). 

.1  3 


Decision  Rule  7.2.  Decide  N  +  S  iff 
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.  m  m 

sup  I  -  l  (z  -  |x|(j))  "  ~  l  (z  -  M  (i)) !  >  d'  =  d(a,m,n) 

z  1  1 

Decision  Rule  7.3.  Decide  N  +  S  iff 

,  ni  ^  n 

-  I  V(R|X.|)  -  -  l  V(R(|Y.|))  >  a.  or  <  a_, 
m  ~  1  j  1  n  J  v 1  i  1  1  2 

where  V,  ,  ....  V  are  i.i.d.  $  and  independent  of  the  data. 

I  n 

This  last  decision  rule  is  based  on  the  Theorem  4.6.  (See  also.  Bell 
and  Doksum  (1965)).  The  first  and  second  rules  above  correspond  to  the 
Wilcoxon  Rank-Sum  statistic,  and  two-sample  Kolmogorov-Smirnov  statistic, 
respectively . 

The  techniques  here,  although  concerned  with  Q(SISI)  are  closely 
related  to  many  of  the  usual  two-sample  non-parametric  detectors.  (See, 
e.g.,  Bell  (1964a;  Model  II  Detectors)). 

Numerical  examples  of  the  decision  rules  here  are  given  in  the 
appendix.  We  conclude  this  section  with  two  more  detectors. 

(A)  A  Modified  Two-Sample  Detector 

The  simplest  rank  detector  (statistic)  for  the  pure  noise  situation 

N 

of  univariate  symmetry  about  zero  is:  T  =  £  e  (Z.)  where  e(y)  =  1 

i=l  1 

if  y  >  0  and  zero  otherwise.  Under  the  null  hypothesis  of  PN,  T  has 
a  Binomial  distribution  with  parameters  N  and  .  However,  it  is  clear 
that  for  any  asymmetric  distribution  with  F(0)  =  ^  ,  the  power  of  any 

test  based  solely  on  T  is  equal  to  its  PFA. 

Let  T  =  k,  and  define  {Z^,  ^}  =  { j  |  :  Z^  <  0}  and 
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{Z*,  .  ..,  Z*}  =  (|z.|:  Z^  >  0}.  There  are  now  two  new  samples  on  which 

one  can  try  any  two-sample  detection  procedure  such  as  the  Kolmogorov- 
Smirnov  detector,  Cramer-von  Mises  detector,  Mann-Whi tney-Wilcoxon 
detector,  Fisher-Yates-Terry  detector  etc.  Each  of  these  tests  is  a 
most  powerful  invariant  detector  against  specific  N  +  S  alternatives, 
see  Lehmann  (1959). 

The  power  of  such  tests  is  equal  to  PFA  for  S  +  N  situations  with 
the  Z  distribution  symmetric  about  zero.  There  is  a  slight  difficulty 
in  applying  detectors  based  on  comparisons  of  {-Z_! }  and  {Z?}  when 
m  or  n  is  small.  For  this  latter  reason,  one  can  introduce  the 
modification  below.  (The  development  below  is  a  slight  extension  of 

some  ideas  in:  Dippo,  C.  (1970) _  Distribution-free  tests  of  univariate 

symmetry  about  zero.  Class  Proiect,  University  of  Michigan.) 

Let  T(m,n)  be  an  arbitrary  two-sample  detector  with  PFA  a  and 

critical  region  C(a;m,n).  Then  let  ^(Z)  =  1  if  T  <  k  or  >  k  ;  =  1 

r\j  l  z 

if  kj  £  T  _<  and  T(m,n)  e  C(a;m,n);  and  zero  otherwise.  The 
power  of  detector  ip  at  =  P{Reject  PN|S  +  N  is  true}  is 

iffdFj  -  P{T  <  kj|F  }  +  P(T  >  k2|F1)  + 
k2 

+  l  P{T  =  j}  P(T(j,N-i)  e  C(a;j,N  -  j)> 
j=kl 

kr1 

=  I 

j=i 
k2  n 

+  l  Cr  P(T(j,N-j)  e  C(a;  j  ,N- j)  }  p]  q^"-1  , 
j=kl 


C.  p 


N-j 


N 

l 

j=k0+l 


N 

C. 

3 


J  N-i 

Pi  <u 
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where  Pj  =  P{Z  <  0 1 }  and  =  1  -  p  . 

This  procedure  eliminates  the  difficulties  when  m  or  n  is  small,  but 
the  power  is  still  a  =  PFA  when  one  has  univariate  symmetry  about  zero  - 
a  desirable  property. 


(B)  A  Chi-Squared  Detector 

Under  the  PN  situation,  divide  the  domain  of  Z^'s  into  sections 

symmetric  about  zero  by  the  set  of  numbers  (a_.  ,  i  =  0,  +1,  + 2 ,  .  +k}. 

The  probability  of  falling  in  the  interval  (a_.  a.)>  i  >  1  under  the 

PN  case  is  the  same  as  falling  in  the  interval  (-a.,  -a.  ^).  The 

a^fs  may  be  chosen  arbitrarily,  be  based  on  some  prior  knowledge,  or, 

better  yet,  be  based  on  the  ordered  statistics. 

Let  Z .  .  ,  Z,...  be  the  order  statistics  of  the  {Iz.l}. 

(1)  (N)  1 

If  the  desired  number  of  intervals  is  k,  let  r.  =  jN/k  and  a.  =  Z, 

3  3  (.r.y 

j  =  1,  2,  ....  k.  Then 
N 


N.  .  =  l  e  (Z. ) [e(a .  -  Z.)  -  e(a.  .  -  Z.)] 
13  i^1  i  3  i  3-1  i 


N0 .  =  T  e  (-Z. ) [e(a.  -  Z.)  -  e(a.  .  -  Z.)]. 
2j  . l  1  i  i  j-l  1 


Let  N,  =  Y  N, .  and  N_  =  I  N,..  Therefore,  under  the  PN 
1-  -ll  2'  “  2i 

J  3 


situation,  the  detector 

l  I 

i  3 


VN2.  2  N 
-]  [  • 


N 


N.  N- 
1*  2- 


is  DF,  and  has  an  asymptotic  null  distribution  of  a  Chi-squared  statistic 
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with  (k  -  1)  degrees  of  freedom. 

8.  Point  Estimation  and  Confidence  Bounds  for  with  X  in  fi(SISI). 

(A)  Point  Estimation 

Recall  the  definitions  of  SISI  and  symmetric  (about  zero)  probability 
laws,  that  is 


ft(SYM)  =  (F(-):  F  is  continuous  and  F(x)  +  F(-x)  =  1  for  all  x} 

ft(SISI)  =  Y.,  ...,  Y  are  i.i.d.  F  e  ft(SYM)  for  all 

i  m 

A  >  0,  where  Y.  =  Z(jA)  -  Z((j  -  1)A),  Z(0)  =  0}. 


Example  8.1.  Let  ,  ...,  X  ,  ...  be  i.i.d.  Cauchy  with  parameter 

(0,0),  C(O,0),  that  is  f(x)  = 


0 


2  2 
TT(0  +  X  ) 


,  -oo  <  x  <  oo;  and  let 


Z6  "  Z3*  Y3  "  Z9  "  V 


Zn  =  £  X^ .  Choose  A  =  3  and  set  Yj  =  Z^,  Y^ 

Then,  for  { Z  :  m  >  l),  (a)  of  e  Q(SISI);  and  (b)  Y. ,  ...,  Y  are 
i.i.d.  C(0,  30)  which  belongs  to  fi(SYM). 

To  estimate  F(x*)  for  x*  >  0,  clearly  a  natural  estimator  is 

-l  m 

F(x*)  =  F  (x*)  =  m"* 1  l  e  (x*  -  X  ). 

j=l  ' 

If  x*  <  0,  use  estimate  1  -  F(-x*).  Obviously, 


m 


E{F  (x*)  }  =  F  (x*)  ,  Var(F(x*)}  =  ■  (x*3  3  =  > 

A 

and  mF(x*)  is  a  Binomial  variable  with  parameters  (m,  F(x*)). 
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An  optimal  estimate  for  F(x*)  can  be  obtained  by  using  only  the 

M-S-S,  S(Z)  =  (|x|(l),  ....  |x|(m)}  or  another,  easier  to  handle, 

^  -1  m 
version  of  it  G*f-)>  where  G*(z)  =  m  J  z  (z  -  |x|(i)). 

m  nr  .L.  ’  1  * 

.1  =  1 

Lemma  8.1.  (a)  P{  |  X ^  |  _<  z}  =  2F(z)  -  1,  z  0  and  zero  otherwise, 

(b)  E(G*(x*)}  =  2F(x*)  -  1; 

(c)  Var{G*(x*) }  -  2[1  -  F(x«)  J  [2F(x») 

v  J  nr  '  m 


On  the  basis  of  the  definition  of  G*(-)  and  the  above  lemma,  define 

m 

another  estimator  of  F(x*)  as 


%  G*(x*)  +  1 

F  (x*)  =  - ^ - 

m  1 


% 


Theorem  8.1.  (a)  F,{F  (x*)}  =  F(x*); 


m 


{l0  Var{F  (x*)}  =  1»  -  rJJ. 

v  1  nr  J  2m 


a. 

(c)  Is  t^ie  UMVU  estimator  for  F(x*) 


a. 


(d)  0  < 


Var(FCx*)> 

Var  F(x*) 


2F(x*)  -  1  ^  1 

2F(x*)  -  2 


Thus,  one  can  do  better  by  using  the  estimator  F(x*)  which  uses  the 
symmetry  property  of  the  underlying  process. 


(B)  Confidence  Bounds 

In  this  section  the  aim  is  to  establish  that  when  one  knows  the  underlying 
distribution  function  is  symmetric  about  zero,  then  one  can  achieve  a 
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confidence  region  having  50%  iess  interval  length  as  compared  to 
the  usual  Kolmogorov-Smirnov-tvpe  confidence  regions.  To  Dut  it  in 
another  way,  it  means  that  with  the  same  confidence  level  (1  -  a")  100% 
of  the  former  confidence  interval  has  half  the  width  than  that  of  the 
latter.  First  we  state  some  relevant  results  which  can  easily  he  proved. 


Theorem  8.2.  (a)  For  Y^,  ...,  Y^  i.i.d.  with  F  e  fl(SYM),  the  M-S-S 

is  S(Y)  =  | |Y|  (1) ,  ...,  | Y | Cm) ] ,  that  is  the  ordered  absolute  values. 

(b)  |  Y.  |  Gj. ,  where 


(c)  F(z)  = 


Gj;fz) 


{2F(z)  -  1.  for 
0,  otherwise 


1 

~  [1  +  Gp(z) ] ,  z  >  0 
\  11  -  Gp  (- z) ]  ,  z  <  0 


z  >  0 


Definition  8.1.  ^(R*)  =  { G :  G  is  continuous,  G(0)  =  0). 

Theorem  8.3.  F  belongs  to  fl(SYM),  if  and  only  if,  Gn  is  a  member 

— - - - — - -  r 

of  the  class  ^(R^)  • 


Definition  8.2.  Let  d(a,m)  be  such  that 


a  =  P  {sup  |H  (z)  -  H(z) |  >  d (a,m) } , 
z 

where  V,  ,  _ _  V  are  i.i.d.  with  distribution  H(*) 

1  m 

-1  m 

continuous;  and  H(z)=m  J  e  (z  -  V 

nr  (j  J 


which  is 
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Let  Y.,  ...,  Y  be  i.i.d.  with  F  E  f2(SYM) .  Then  Iy.I,  ....  |Y  I 
1  m  1  1'  1  m  1 

are  i.i.d.  with  G  e  f2„(R*). 

r  Z  1 

Lemma  8,2.  The  following  statement  holds: 


P{G^(z)  -  d(a,m)  <  Gp(z)  <  G^(z)  +  d(a,m)  for  all  z}  =  1  -  a, 

where  G*(-)  is  the  emnirical  distribution  of  the  IyI's. 
m  •  11 

Lemma  8.3.  (a)  A  consequence  of  the  above  result  and  the  previous 

development  is  that: 

1  +  G*(z)  1  +  G*  (z) 

P{ - - ~  T  d(a,m)  <  F(z)  < - — - - +  ^  d(a,m)  for  all 

=  1  -  a  =  P(A+) ,  say. 

1  -  G*  (y)  1  -  G* (y ) 

(b)  P(A+)  =  1  -  a  =  P  { - y3 - J  d(a»"0  <  p(y)  <  ~  ~~~  + 


z} 


\  d(a,m) 


for  all  y  <  0} 

=  P(A"),  say. 

(c)  P(A+}  =  P(A'}  =  P{A+A  A"}  ■  1  -  a. 


Definition  8.3.  Let  L*(z)  and  L  (z)  be  defined  as  follows 
-  nr  *  m 


l;(z)  = 


\  [1  +  G*U)]  +  j  d(a,m) ,  z  >_  0 


\  [1  -  G*(-z)]  +  yd(a,m),  z  <  0 
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L(z) 

m 


(\  [1  +  G*(z)]  -  \  d(a,m),  z  >  0 

j  [1  -  G*(-z)]  '  Td(a>m1>  z  <  0 


Theorem  8.4.  Let  Y,  ,  Y  be  i.i.d.  with  !•(•)  in  fi(SYM)  and 

— — • — ~ -  1  m 

i  m 

G* (z)  =  m"  l  c  (z  -  | Y | C j ) ) -  Then 
3  =  1 

(a)  P(h  (z)  <  F(z)  <  L*(z),  for  all  z}  =  1  -  a. 

v  ^  m  m 

(b)  L*(z)  "  =  d(a,m)  for  all  z. 

The  above  result  combined  with  the  statement  below  helps  achieve  our 
aim  set  in  the  beginning  of  this  section. 

Theorem  8.5.  Let  V.,  ,  ....  V  be  i.i.d.  with  distribution  H(-)  which 

is  continuous.  Then 

(a)  P{H^(z)  -  d(a,m)  <  H(z)  <  H^(z)  +  d(a,m)  for  all  z}  =  1  -  a 

(b)  [H^(z)  +  d(a,m)]  -  [H^(z)  ~  d(a,m)]  =  2d(a,m)  for  all  z. 

These  results  can  be  presented  in  a  slightly  different  form  when  one 
constructs  the  following  empiric-type  distribution  -  the  empirical 
distribution  of  data: 

- | Y | (m) ,  -  | Y | (m  -  1),  -  |y|(1);  |y|(1),  |y( fm) 

Definition  8.4.  Let  G**(z)  be  as  below: 

— - - -  m 

m  m 

G**  (z)  =  (2m)'1  [  l  e  (z  -  ) Y |  (i ) )  +  l  e  (z  +  |Y|(i))]. 

m  3=1  3=1 
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Lemma  8.4.  With  probability  one, 


(a)  G**(z)  =  \  {1  +  [G*(z)  -  G*(-z)]}, 


for  all  z;  and 


(b)  G**(z) 

m 


(  2 

j  [1  +  <^(z)J,  z  -  0 

ji>-  «;<-*)]•  z  <  o 


Theorem  8.6.  Let  ,  . . . ,  be  i.i.d.  with  F(*)  in  f2(SYM) .  Then, 
p{C**(z)  -  jd(a,m)  <  F(z)  <  G**(z)  +  j  d(a.m)  for  all  z} 

=  1  -  a. 

The  results  of  this  section  "fit  in"  well  with  the  idea  that  in 
constructing  PDF  (parametric  distribution-free)  statistics  for  fi(SISI) 
one  should  use  only  the  M-S-S,  S(Z)  =  [|y|(1),  ...,  |y|(.jo)].  To  use 

*\j 

any  "additional  aspects  of  the  data"  should  be  inefficient. 
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APPENDIX  FOR  flfSISI)  FAMILY 
ILLUSTRATIVE  EXAMPLES  AND  GRAPHS + 


Example  1.  To  illustrate  the  Goodness -of- fit  detection  procedure  we  considered 
the  data  given  by  Dewey  (1963) .  The  graph  of  the  data  is  shown  in  Graph 
1(a).  Figure  (b)  shows  the  power  spectrum  of  smoothed  U.S.A.  immigration 
data  of  Dewey.  By  taking  the  period  1830'  to  1950  we  looked  at 
residuals  of  logarithms  of  immagration  data,  after  removal  of  trend  by 
a  simple  seventeen-point  moving  average  -  values  increased  by  2,  see 


Kendall  (1976,  p.  103).  Let  X.  denote  the  ith  year  residual  value 


for  the  period  considered,  then  set  =  X^  + 


+  x1Q,  w2  xn  + 


+  X 
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...;  W^2  =  Xju  +  ...  +  Xj2q;  here  A  =  10  and  n  =  1,  2,  ...,  12.  This 
process  gives  a  probability  law  which  belongs  to  Q(SISI)  but  is  not 
symmetric  about  zero.  The  aim  is  to  detect:  -  PN:  £  Q(SISI)  (with 

symmetry  around  18)  against  N  +  S:  ,  U  indicates  uniform 

distribution.  The  ordered  W^'s  with  computations  and  decision  rule  are 
presented  below.  . 


+This  Appendix  was  prepared  with  the  aid  of  S-M  Lee  and  A.  Mason  of 
San  Diego  State  University. 


*Dewey,  E.  R.  (1963).  The  18- 2-year  cycle  in  immigration,  U.S.A. , 
1820-1962.  Foundation  for  the  Study  of  cycles,  Inc.,  Pittsburgh, 
PA. 


Table  1 :  U.S.A.  Immigration  Data  1830-1950, 


Ordered  Data 

Computations 

W 

u) 

w ... 

1  (.1) 

W 

(j)  i  -  l 

12  40 

40  12 

18.5484 

-0.3804 

0.4637 

19.1244 

-0.3114 

0.3948 

19.1562 

-0.2289 

0.3122 

19.1853 

-0.1463 

0.2296 

19.3680 

-0.0675 

0.1509 

19.5932 

0.0102 

0.0732 

19.8373 

0.0874 

-0.0041 

20.4530 

0.1553 

-0.0720 

20.7822 

0 . 2304 

-0.1471 

20.8606 

0.3118 

-0.2285 

21.0688 

0.3899 

-0.3066 

22.0959 

0.4476 

-0.3643 

The  detection  procedure  used  is  the  Kolmogorov-Smirnov  statistic 


W  W  .  . 

D-.  -  =  max  {max  > 


12 


1 


12  40  *  40  12 


0.4637 


With  PFA  a  =  .01,  the  critical  value  is  c*22-005  =  ^*449,  see  Conovor 


(1971,  p.397) 


Decision  Rule:  Decide  N  +  S  since  D^  >  ^i2‘005’ 


Example  2.  (Random  Walk) .  To  explain  the  detection  procedures  for  the 
Class-Fit  problem,  30  observations  were  generated  from  a  Cauchy  distribution 


30  - 


with  location  and  scale  parameters,  respectively,  zero  and 
Yj ,  Y^,  ...,  Y^q  are  i.i.d.  r.  vs.  from  C(0,2).  Let 

then  {Z(j)J  is  a  stochastic  process  with  law  in  the  family  H(SISI) . 

Choose  A  =  3,  and  set  =  Z(3),  X2  =  Z(6)  -  Z(3),  ...,  X  Q  = 

Z(30)  -  Z(27).  Clearly,  (Z(j)}  forms  a  random  walk  and  {X.}  is  a 
sample  from  it.  For  the  plot  of  the  data  see  Cranh  2. 

Table  2.  Random  Walk  Data 


2 .  Thus , 

Z(.D  =  l  Yr  , 


Data 


Computati ons 

=  -97.8398 

X. 

i 

e(X.) 

R(|X.  |) 

-105.2880 

15.7933 

1 

6 

-39.1513 

-103.4208 

0 

10 

-8.3701 

-1.7487 

!  0 

1 

-6.9749 

-1.8538 

0 

2 

2.6035 

-7.4049 

0 

4 

0.0598 

59.4836 

1 

9 

-4.5361 

41.7548 

1 

8 

-0.1942 

-2.7977 

0 

3 

2.9335 

-30.4395 

0 

7 

3.1908 
-30.6337 
-31.9554 
-29.7549 
=  -18.9902 

11.6435 

1 

5 

Y  =  0.5143 

Y  =  13.2748 

7  15.7933 

!  14.3173 

14.0243 

-87.6275 

-89.3277 

-90.7272 

-89.3762 

-90.1488 

-94.6573 

-91.2300 

-90.6965 

-96.9435 

Y  =-98.5349 
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To  detect  PN:  e  fl(SISI)  versus  £  £  Q(SISI),  the  procedure  is 


as  given  below. 
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The  Sign  Detector.  One  proceeds  as  follows: 

10 

T*  (X)  =  l  E  (X  )  =  4 
O.  j_2  J 

P{T*q<  1  ;  n  =  10}  =  1  -  P{T*q  >  8;  n  =  10}  =  0.0107 

Therefore,  a  =  0.0214;  and  one  decides  N  +  S,  if  and  only  if, 

T*  (X)  >  8  or  <1.  The  conclusion  is  decide  PN,  with  PFA  =  a  =  .01. 

i  O  f\j  —  — 

The  Wi lcoxon-Si gned-Rank  Detector.  In  this  situation  one  computes 
10 

W*  (X)  =  l  e  (X.)  R(|X. I )  =  28.  We  decide  N  +  S,  if  and  only  if 
iU  ^  j=i  31 

W*  (X)  >  k  or  where 

1U  r\j  1  Z 

k1  =  t(a/2,  n)  =  t(.005,10)  =  52 

k2  =  "V  1')  "  t(a/2,  n)  =  3 

("t"  denotes  "tabulated  value"  obtained  from  Hollander  and  Wolfe  (1973)). 
Since  W*^  is  outside  the  critical  region,  one  decides  PN. 

N(t) 

Example  3.  (Compound  Poisson  Process).  Let  Z(t)  =  /_  T.,  where 

j  =  l  3 

{Tj}  are  i.i.d.  r.  vs.  from  double  exponential  with  parameter  4; 

(N(t)}  is  homogeneous  Poisson  process  with  parameter  0.5;  and  {T. } 
and  {N(t)}  are  independent.  Table  3  below  gives  the  generated  data 
from  this  process.  The  plot  for  jumps  is  shown  is  graoh  3. 
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We  sample  every  three  units,  that  is  the  basic  data  is 
(Z (3) ,  Z (6) ,  Z(30) } .  Let  Xx  =  Z(3),  X2  =  Z(6)  -  Z(3), 

X^q  =  Z(3)  -  Z  (27)  .  The  interest  here  is  to  detect  PN:  £  ST  (S  IS  I ) 

against  N  +  S:  ^ £  S2(SISI).  The  appropriate  detection  statistic  is  the 
Wilcoxon  one-sample  statistic.  The  essential  computations  are  as  follows: 


Compound  Poisson  Data 


j 

Z(3j) 

Xi=Z(3j)- 

Z(3(i-1)) 

pclx.b 

1 

0.4576 

0.4576 

5 

2 

1.3184 

0.8608 

8 

3 

2.1871 

0.8687 

9 

4 

1.7750 

-0.4121 

4 

5 

2.7044 

0.9294 

10 

6 

3.0626 

0.3582 

3 

7 

3.1917 

0.1291 

1 

8 

2.7303 

-0.4614 

6 

9 

2.4915 

-0.2388 

2 

10 

3.0675 

0.5760 

7 

12 

Decision  Rule.  Since  W*2  =  J  e  (X^ ) R ( |  X_.  | )  =  43  does  not  fall  in  the 


critical  region 


{W*  ^52 


or 


"  h  i 3> 


with  P FA  a  =  .01, 


we  decide  PN. 
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Table  3.  Compound  Poisson  Process  Data 


Interarrival  Times 

"Jumps11 

Cumulative  Jumps 

1.0895 

0.1250 

0.1250 

0.1005 

0.5383 

0.6633 

0.4816 

-0.2057 

0.4576 

2.4213 

-0.0637 

0.3939 

1.5793 

-0.0065 

0.3874 

10.2320 

|  0.9310 

1.3184 

2.5747 

0.3350 

1.6534 

2.3683 

0.0680 

117214 

0.7450 

0.4657 

2.1871 

1.9194 

-0.1040 

2.0831 

4.0348 

^•0 .2088 

1.8743 

0.3678 

-0.0993 

1.7750 

1.0791 

0.0779 

1.8529 

4.6253 

0.9813 

2.8342 

2.5675 

!  -0.1298 

2.7044 

6.9561 

-0.0153 

2.6891 

4.9539 

0.1758 

2.8649 

0.0201 

0.1977 

3.0626 

0.0424 

0.0324 

3.0950 

0.7306 

-0.1454 

2.9496 

0.1366 

0.2421 

3.1917 

3.1023 

-0.2848 

2.9069 

4.0651 

-0.2780 

2.6289 

0.2947 

1  -0.1014 

2.7303 

0.3993 

-0.0002 

2.7301 

1.2287 

-0.3191 

2.4110 

7.9266 

0.0805 

2.4915 

2.3167 

0.3073 

1  2.7988 

0.5358 

0.0422 

2.8410 

0.1195 

0.2265 

3.0675 
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Example  4  (Wiener-Levy  Process).  Let  W(t)  =  y(t)  +  a  W*(t),  where 

(W*(t),  t  0}  is  a  Wiener-Levy  process  (WLP)  satisfying  (i) 

EW*(t)  =  0;  (ii)  Cov{W*(s),  W*(t)}  =  min  (s,t);  (iii)  it  is  Gaussian. 

Let  Z  =  W(jA),  that,  is  one  samples  at  times  A,  2A,  .  ..,  and 

X  =  (X  ...,  X  )  where  X.  =  Z.  -  Z  and  Z  =  0.  If  y(t)  =  0, 

%  1  m  J  j  i-l  0  v  1  ’ 

the  family  of  such  processes  is  denoted  by  fi(WLPd>);  and  if  p(t)  =  t, 
then  it  is  denoted  by  fi(WLPL) . 

In  this  example  we  illustrate  the  2-samplc  detection  procedure  for 
PN:  X j  =  c  fi(STSI)  against  N  +  S:  £  /  e  fi(SISI) 

for  i  =  1 ,  2 . 

For  WLP(f>,  we  chose  (y,a,A)  =  (0,  5,  0.1)  and  generated  the  first 
set  of  data  (X^,  ...,  X  )  (m  =  15)  as  follows: 

(1)  Generate  30  N(0,1)  observations. 


(i) 

Generate 

91 

J  »  •  • 

,  0 15  11(0,  2tt) 

observations , 

(ii) 

Generate 

Rl> 

•  *  *  ) 

Rj5  Exp.  (1/2) 

observati ons . 

(2) 

Let 

X*  =  R 

2m- 1  m 

COS 

9m> 

X*  =  R  sin  0  . 
mm  m 

(3) 

X!  = 

3 

5f2\  X*  gives 

3 

xr 

....  xio  i.i.d. 

r.  vs.  from  N(0,  25A) 

(4) 

Y  t  - 
*1 

Z(A),  X^  = 

=  Z(2A)  - 

ZtA).  .... 

=  Z  (30A)  -  Z(29 A)  gives 

Z(jA)  =  l  X! ,  i  =  1 ,  2,  . . . ,  30 .  This  provides  first-sample 
i=l  1 

(Xj,  X2,  ...,  X15).  By  choosing  (p(t),a,A)  =  (3t,  5, A  =  0.5), 
we  generate  similarly  the  second  sample  of  observations  (fromWLPL)  which 
is  denoted  by  (Y^ ,  . . .  ,  Y15)  •  Thus  the  BDT  is  Z  =  (Xj,  ....  X 
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Y^,  .  ..,  Y^.).  The  probability  laws  for  X  and  Y,  respectively ,  are  denoted 
by  and  o£ Hie  plots  of  (X!)  and  {Z(jA}}  for  both  processes 

are  shown  in  graphs  4  to  7.  The  Wi lcoxon  Rank-Sum  Detector  is  given 
below;  and  the  Kolmogorov- Smirnov  Detector  is  presented  in  Table  4. 

(A)  Wi lcoxon  Rank-Sum  Detector.  The  data  is  as  given  in  the  first  two 
columns  of  Table  4.  By  using  {X . }  we  compute: 

{R(|X  |),  j  =  1,2,  ...,  15}  =  {1,2,3,4,6,8,10,11,12,14,15,16,17,  18,  21> 

15 

Detector:  W  =  l  R(|x. |)  =  158. 

j=l  3 

Using  a  large  sample  approximation  for  m  =  n  =  15,  one  gets 

W*  =  =  -3'09’  W*  ^  asymptotically. 

Decision  Rule:  Since  W*  <  Z  =  -1.96,  we  conclude  N  +  S. 

-  a/2  * 

The  conclusion  is  still  the  same  if  one  chooses  PFA  a  =  .01. 
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(B)  Two-Sample  Kolmogorov~Smi rnov  Detector  for  Wiener  Levy  Data 


Table  4 .  Computati ons 


Ordered  Combined  Data:  |z|(i) 

|US)_1 

15  , 

I  e(z-|x|(j)H15)  £e(z-|Y|(j))| 

1  ^ 

|x|  (j) 

Mm 

j-1 

0.0680 

. 

1/15 

0.1644 

- 

2/15 

0.2135 

- 

3/15 

0.4502 

- 

4/15 

0.499 

3/15 

0.6046 

- 

4/15 

m  =  n  =  IS 

- 

0.7744 

3/15 

0.9073 

4/15 

0  =  sun  1 F  (z) -G  (z) 1 

m,n  1  n v  J  1 

- 

0.9714 

3/15 

1.1185 

__  1 

|  4/15 

z 

1.6135 

5/15 

=  10/15 

1.6879 

- 

|  ' 

6/15 

- 

1.6923 

5/15 

dm,n;  .025  =  7/15  CPFA=.0S) 

1.9042 

6/15 

2.0984 

- 

7/15 

2.2764 

2.2999 

8/15 

9/15 

dm,n;  .005  '  8/15  <PW=-0I> 

2.4765 

- 

10/15 

[Critical  values  are  from 

- 

2.6325 

9/15 

Conovor  (1971  ,  p.  399) 

- 

2.8479 

8/15 

3.285 

- 

9/15 

- 

3.4773 

8/15 

Decision  Rule: 

- 

4.0665 

7/15 

- 

4.3455 

6/15 

Decide  N  +  S  since 

- 

4.3518 

5/15 

: 

6.6080 

7.2577 

4/15 

3/15 

D  >  d  g  (a= . 01) . 

m,n  m,n;a/2 

- 

8.0898 

2/15 

- 

8.1427 

1/15 

8.5778 

0 

